3574 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
1.2 GByte Production Status:
Newly created-finished
Use:
Named Entity Recognition
-
Paper title:Transforming Wikipedia into a Large-Scale Fine-Grained Entity Type Corpus
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Abbas Ghaddar | Université de Montréal | CA |
| Author 2 | Phillippe Langlais | Université de Montréal | CA |
| Main Contact | Abbas Ghaddar | Université de Montréal | None |
Documentation:
<Not Specified>
Not Applicable
Tagger/Parser,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
Connexor
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
sentence splitting, tokenisation, syntax analysis
-
Paper title:Diachronic Changes in Text Complexity in 20th Century English Language: An NLP Approach
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Sanja Štajner | University of Wolverhampton | None |
| Author 2 | Ruslan Mitkov | University of Wolverhampton | None |
| Main Contact | Sanja Stajner | University of Wolverhampton | GB |
Documentation:
www.connexor.euLanguage Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
100 million words Production Status:
Existing-used
Use:
Measuring representativeness
-
Paper title:Using Word Familiarities and Word Associations to Measure Corpus Representativeness
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Reinhard Rapp | Aix-Marseille Université | FR |
| Main Contact | Reinhard Rapp | Aix-Marseille Université | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:Sublanguage Corpus Analysis Toolkit: A tool for assessing the representativeness and sublanguage characteristics of corpora
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Irina Temnikova | Qatar Computing Research Institute | BG | Qatar Computing Research Institute, HBKU | QA |
| Author 2 | William A. Baumgartner Jr. | University of Colorado School of Medicine | US | U. Colorado School of Medicine | US |
| Author 3 | Negacy D. Hailu | University of Colorado School of Medicine | US | ||
| Author 4 | Ivelina Nikolova | Bulgarian Academy of Sciences | BG | ||
| Author 5 | Tony McEnery | Lancaster University | GB | ||
| Author 6 | Adam Kilgarriff | Lexical Computing Ltd. | GB | ||
| Author 7 | Galia Angelova | Bulgarian Academy of Sciences | BG | ||
| Author 8 | K. Bretonnel Cohen | University of Colorado School of Medicine | US | ||
| Main Contact | Irina Temnikova | Qatar Computing Research Institute, HBKU | None | Sofia University | None |
Documentation:
<Not Specified>
Written
Web Service,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Web Services
-
Paper title:Improving Cloze Test Performance of Language Learners Using Web N-Grams
-
Paper track:Applications
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Martin Potthast | Bauhaus-Universität Weimar | None | ||
| Author 2 | Matthias Hagen | Bauhaus-Universität Weimar | DE | ||
| Author 3 | Anna Beyer | Bauhaus-Universität Weimar | None | ||
| Author 4 | Benno Stein | <Not Specified> | None | Bauhaus-Universität Weimar | None |
| Main Contact | Matthias Hagen | Bauhaus-Universität Weimar | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-in progress
Use:
Dialogue
-
Paper title:Identification of Personal Information Shared in Chat-Oriented Dialogue
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Sarah Fillwock | Michigan State University | US | ||
| Author 2 | David Traum | University of Southern California Institute for Creative Technologies | US | USC ICT | US |
| Main Contact | Sarah Fillwock | Michigan State University | None |
Documentation:
The creation of English documentation is in progress.Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
196 KByte Production Status:
Newly created-finished
Use:
Named Entity Recognition
-
Paper title:Label Embedding for Zero-shot Fine-grained Named Entity Typing
-
Paper track:Morphology, Segmentation, Tagging, Chunking
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Yukun Ma | Nanyang Technological University | SG |
| Author 2 | Erik Cambria | Nanyang Technological University | N/A |
| Author 3 | SA GAO | Nanyang Technological University, Singapore | SG |
| Main Contact | Yukun Ma | Nanyang Technological University | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English German
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Manual Analysis of Structurally Informed Reordering in German-English Machine Translation
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Teresa Herrmann | Karlsruhe Institute of Technology | DE |
| Author 2 | Jan Niehues | Karlsruhe Institute of Technology | NL |
| Author 3 | Alex Waibel | Karlsruhe Institute of Technology | DE |
| Main Contact | Teresa Herrmann | Fujitsu | None |
Documentation:
<Not Specified>
Written
Representation-Annotation Formalism/Guidelines,
Language Type:
Multilingual
Languages:
English french
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Word Alignment
-
Paper title:On Complex Word Alignment Configurations
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Miriam Kaeshammer | University of Düsseldorf | DE |
| Author 2 | Anika Westburg | University of Düsseldorf | DE |
| Main Contact | Miriam Kaeshammer | University of Düsseldorf | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
Not Available
License:
Creative Commons
Size:
5 MByte Production Status:
Newly created-in progress
Use:
Dialogue
-
Paper title:The ADELE Corpus of Dyadic Social Text Conversations:Dialog Act Annotation with ISO 24617-2
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Emer Gilmartin | Trinity College Dublin | IE |
| Author 10 | Vincent Wade | ADAPT Centre Trinity College Dublin | IE |
| Author 2 | Christian Saam | ADAPT Centre, Trinity College Dublin | IE |
| Author 3 | Brendan Spillane | ADAPT Centre, Trinity College Dublin | IE |
| Author 4 | Maria O'Reilly | Trinity College Dublin | IE |
| Author 5 | Ketong Su | ADAPT Centre, Trinity College Dublin | IE |
| Author 6 | Arturo Calvo | Accenture Norway | NO |
| Author 7 | Loredana Cerrato | Eit Digital | SE |
| Author 8 | Killian Levacher | IBM | IE |
| Author 9 | Nick Campbell | Trinity College Dublin | IE |
| Main Contact | Emer Gilmartin | Trinity College Dublin | None |
Documentation:
Manual currently undergoing revision and validation




